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Evaluation of brief screening tools for neurocognitive 
impairment in HIV/AIDS: a systematic review of 

the literature 

Amy R. Zipursky a,b , David Gogolishvili 3 , Sergio Rueda a,b , 
Jason Brunetta c , Adriana Carvalhal b,d , Jennifer A. McCombe e , 
M. John Gill e , Anita Rachlis b ' f , Ron Rosenes 3 , Gordon Arbess M , 
Thomas Marcotte 8 and Sean B. Rourke a,b,d 

Objective(s): To systematically review literature on brief screening tools used to 
detect and differentiate between normal cognition and neurocognitive impairment 
and HIV-associated neurocognitive disorders (HANDs) in adult populations of persons 
with HIV. 

Design: A formal systematic review. 

Methods: We searched six electronic databases in 2011 and contacted experts to 
identify relevant studies published through May 2012. We selected empirical studies 
that focused on evaluating brief screening tools (<20min) for neurocognitive impair- 
ment in persons with HIV. Two reviewers independently reviewed retrieved literature 
for potential relevance and methodological quality. Meta-analyses were completed on 
screening tools that had sufficient data. 

Results: Fifty-one studies met inclusion criteria; we focused on 31 studies that com- 
pared brief screening tools with reference tests. Within these 31 studies, 39 tools were 
evaluated and 67% used a comprehensive neuropsychological battery as a reference. 
The majority of these studies evaluated HIV-associated dementia (HAD). Meta-analyses 
demonstrated that the HIV Dementia Scale (HDS) has poor pooled sensitivity (0.48) and 
the International HIV Dementia Scale (IHDS) has moderate pooled sensitivity (0.62) in 
detecting a range of cognitive impairment. Five newer screening tools had relatively 
good sensitivities (>0.70); however, none of the tools differentiated HAND conditions 
well enough to suggest broader use. There were significant methodological short- 
comings noted in most studies. 

Conclusion: HDS and IHDS perform well to screen for HAD but poorly for milder 
HAND conditions. Further investigation, with improved methodology, is required to 
understand the utility of newer screening tools for HAND; further tools may need to be 
developed for milder HAND conditions. 
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Introduction 



Since the introduction of combination antiretroviral 
therapy (cART), the incidence of severe forms of 
HIV-associated neurocognitive impairment has declined 
significantly, whereas the prevalence of the milder forms 
has increased [1,2]. The nomenclature originally devel- 
oped by the American Academy of Neurology Task Force 
on AIDS in 1991 was recendy updated in 2007 in response 
to the change in presentation and research available on the 
natural history of HIV-associated neurocognitive compli- 
cations [3]. There are now three different HIV-associated 
neurocognitive disorder (HAND) conditions. In order 
of increasing severity and related impact on everyday 
functioning, these are first, asymptomatic neurocognitive 
disorder (ANI) with a prevalence of 30—35% and 
characterized by at least mild neurocognitive impairment 
in two domains (at least 1 .0 SD below the mean for age- 
education-appropriate norms) but with no evidence of 
difficulty with day-to-day functioning; second, mild 
neurocognitive disorder (MND) with a prevalence of 
20-25% and characterized by at least mild neurocognitive 
impairment in two domains (at least 1.0 SD below the 
mean for age— education appropriate norms) and with 
mild interference with day-to-day functioning; and third, 
HIV-associated dementia (HAD) with a prevalence of 
2—3% with generally moderate to severe impairments in 
neurocognitive functioning in multiple domains (2 SD or 
greater than demographically corrected means) and 
marked difficulties with everyday functioning [3] . Overall, 
the prevalence of HAND based on the new criteria is 
approximately 50-60% [1,4]. 

In the early years of the HIV epidemic, 20—30% of 
patients presented with severe cognitive impairment that 
was later classified as HIV- 1 -associated dementia complex 
[5]. The HIV Dementia Scale (HDS) was developed to 
screen for HAD [6] and later the International HDS 
(IHDS) [7] was developed for use in global settings. 
These instruments have had varying degrees of success 
in screening of those with the milder forms of HAND 
(i.e. ANI and MND). The clinical significance of 
identifying the milder forms of HAND is important, as 
they can have a significant impact on the lives of people 
living with HIV. They have been shown to interfere 
with medication adherence [8,9], workplace perform- 
ance [10], driving [11] and ability to carry out tasks 
independently [12,13]. The consequences of these 
effects have been seen in decreased health-related 
quality of life [14] and increased mortality rates [2,15] in 
people with HAND. 

To better assist in prevention, treatment and management 
of HAND, it is important that screening tools be 
developed and evaluated. Before widespread use for 
clinical treatment decision-making, these tools need to be 
shown to have adequate sensitivity and specificity with 
HAND; they need to be brief enough to be used in the 



clinic or health centre, and able to be administered by a 
trained individual with minimal equipment. 

To determine which screening tools for HAND meet 
these criteria, we have undertaken the first systematic 
review of the literature with respect to the screening tools 
for HAND. Our objective is to systematically review 
the literature on brief screening tools that can detect 
neurocognitive impairment and are able to differentiate 
among the various forms of HAND in the adolescent and 
adult population of persons living with HIV and AIDS. 



Materials and methods 



We conducted literature searches in MEDLINE, 
EMBASE, PsycINFO, LILACS and CINAHL in May 
2011 for published, full-length research articles. An 
experienced librarian in systematic review methods 
assisted us in developing a search strategy (included in 
Appendix A, http://links.lww.com/QAD/A370). We 
did not apply date or language restrictions in the search 
strategy because we did not want to limit the investigation 
to tools administered only in English. We also contacted 
key experts in the field for recommendations and received 
articles for potential inclusion until the start of May 2012. 

Inclusion criteria 

Participants 

Only studies reporting on participants who were 
adolescent or adult age were included (generally age 
> 1 6 years) . To be included in the review, individual studies 
needed to include participants who were HIV-positive and 
with some form of documented HIV-associated neuro- 
cognitive impairment, or be characterized/ diagnosed with 
having any HAND [i.e. ANI, MND, minor cognitive 
motor disorder (MCMD), or HAD] [3,5]. The studies 
evaluated neurocognitive impairment in different ways, 
and this is a limitation of the current state of the literature. 

Screening tools 

We considered studies that focused on screening for 
'neurocognitive' or 'neuropsychological' impairments or 
HAND [3,5]. The ways in which studies addressed 
screening tools included evaluating the utility of a specific 
screening tool, comparing different screening tools and 
validating screening tools. The focus of the review was 
then restricted to include only those articles that evaluated 
the utility of screening tool(s) against a separate reference 
or criterion. We defined screening tools to include pen 
and paper tests, algorithms and computerized tests and 
excluded biological tests (i.e. studies looking at bio- 
markers). In addition, we focused on 'brief screening 
tools, and we defined this as those that took approxi- 
mately 20 min or less to perform. This was done because 
the focus was screening tests that might be potentially 
useful in a clinic or healthcare setting to detect HAND. 
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Study designs 

All studies needed to report findings from empirical 
research. We included primary studies and considered 
both cross-sectional and longitudinal investigations. We 
excluded primary studies that used qualitative methods or 
case study methods. We also considered systematic review 
articles. 

Search and data extraction 

Two reviewers independently assessed the titles and 
abstracts of the citations from the search according to the 
selection criteria. For the citations that met the criteria, 
data from full texts were extracted by one reviewer and 
checked independently for accuracy by a second 
reviewer. We contacted study authors to obtain any 
information that was missing. Any disagreements 
between reviewers on data extracted were resolved by 
consensus or by a third reviewer. 

Methodological quality 

Two reviewers independently appraised the quality of the 
selected articles from the search using the Quality 
Assessment of Diagnostic Accuracy Studies (QUADAS) 
tool [16]. This 14-item tool was specifically designed to 
assess the quality of primary studies addressing diagnostic 



accuracy. Disagreements between the reviewers on the 
different QUADAS items were resolved by consensus. 



Results 



There were 2030 citations in the database searches and 
334 of these were duplicates between databases. The 
review of titles and abstracts resulted in the identification 
of 316 articles for full assessment, with 1380 articles being 
excluded because they did not meet our inclusion 
criteria. We completed a full appraisal of 304 articles (we 
were unable to locate 12 full articles despite contacting 
the authors), and at this stage found that 253 of the 304 
did not meet inclusion criteria resulting in 51 studies for 
inclusion. Articles that did not meet inclusion criteria 
were for the following reasons: six did not focus on 
people living with HIV; 185 did not evaluate screening 
tools; 28 did not include screening tools that are brief; 
eight did not report on empirical studies; and 26 articles 
did not include participants with HAND or impairment. 
[See Fig. 1 for Search strategy]. 

Of the 51 studies that met criteria [6,7,17—65], we 
identified two main categories into which these studies 



0 



© 
O 
O 



Titles and abstracts identified 
through database searching 
(n = 2022) 



Studies identified through experts 
(1 = 8) 



References after duplicates removed 
(n= 1696) 



LJJ 



References screened 
(n= 1696) 



References excluded (did 
not meet selection 
criteria) 
(n= 1380) 



1 




Full-text articles excluded 
(did not meet selection 

criteria) 

(n = 253) 
Unable to locate full text 

(n=12) 


Full-text articles assessed 
for eligibility 
(n = 316) 




► 


v 




Studies included in 
qualitative synthesis 
(n = 51) 







-o 

T3 
=! 



Studies included in 
quantitative synthesis 
(meta-analysis) 
(n=11) 



Fig. 1 . Search strategy. 

Copyright © Lippincott Williams & Wilkins. Unauthorized reproduction of this article is prohibited. 



2388 AIDS 2013, Vol 27 No 15 



could be divided: first, studies that evaluated screening 
tools by comparing them with a reference standard or 
criterion (n = 31 of 51 or 61% of studies evaluated) 
[6,7,17—44,64], and second, studies that evaluated 
screening tools by other methods (« = 20 of 51 or 39% 
of studies) [45-63,65]. For the purposes of the current 
article, we decided to focus our systematic review analyses 
on the first category of studies (« = 31), [see Table 1 for 
Characteristics of studies on brief screening tools for 
HIV-associated neurocognitive disorders] as this is the 
most accepted method of evaluating a diagnostic tool 
(Results from the second category of studies can be found 
in Appendix B). 

Study characteristics (/V = 31 studies) 

Thirty-one studies are the focus of this subsequent 
analysis. Thirty of the 31 used a cross-sectional design 
[6,7,17—36,38—44,64] and one used a longitudinal design 
[37]. Eighteen of these studies (60% of all studies 
examined) were conducted in the USA [6,7,17-19,22, 
23,26,27,31,33-38,42,44]; three in Australia [20,21,32]; 
three in South Africa [24,28,41]; one each in the UK 
[25]; Germany [43]; Belgium and the Netherlands [29]; 
Switzerland [40]; Uganda [7]; Kenya [30]; India [64]; and 
one at sites across the world [39] . The participants in all 
of the studies combined totalled a sample of 5837 
participants, and the individual study samples ranged from 
20 [41] to 1549 [39] participants. Twenty-five studies 
used convenience sampling [6,7,17—19,22,24,25,27—33, 
35—41,43,44,64], and two used random sampling 
[20,34]. 

Neurocognitive screening tool evaluations 

Types of screening tools evaluated 
Within these 31 studies, 39 screening tool evaluations 
were conducted with 21 unique screening tools. Eleven 
studies evaluated the HDS (35%) [6,18,19,24,31,34, 
38,40,42-44]; four the IHDS (13%) [7,28,41,64]; three 
the CogState Computerized Battery (10%) [20,32,35]; 
three the Mini-Mental State Examination (MMSE) (10%) 
[6,31,64]; and other tools were evaluated in the remaining 
16 studies [6,17,19,21-23,25-27,29,30,33,36,37,39,64]. 
Twenty-one (68%) [6,7,17-23,26,27,30,32,33,35-39, 
41,42] studies reported evaluating screening tools in 
English and eight (26% of studies overall) [7,24,28-30, 
41,43,44] in another language. 

Presence and severity of neurocognitive impairment 
assessed 

Of the 31 studies, 11 (35%) focused on detecting HAD 
[6,7,18,22,28,30,31,34,40,44,64], four also screened for 
MND [28,30,40,64] or MCMD [18,22,34,44], and five 
screened for ANI [28,30,34,40,64]. Seventeen studies 
(55%) focused on detecting neurocognitive impairments 
and/or deficits [17,18,20-23,25-27,29,33,35,36,38,41, 
42,66]. Eight studies (26%) [6,7,18,19,22,31,34,44] used 
1991 criteria [5] and five studies (16%) [21,28,30,40,64] 
used the revised 2007 criteria [3] for diagnosing HAND. 



It is important to note that many of the studies were 
published prior to the establishment of the 2007 HAND 
criteria. 

Assessment of functional status 

Functional status was measured objectively in nine 
studies (29%) [20-22,30,34,38,39,44,64] (29%) and 
subjectively in seven studies (23%) [23,26,28,29,37, 
40,42]. It is worth noting that functional status was 
assessed differently across studies, and only three of the 
studies that objectively assessed functional status 
[20,29,63] used the recent 2007 Frascati criteria to 
classify participants. 

Length of time of reference test/criterion 
We were interested in determining whether the quality 
and rigour of the criterion or reference test used was 
related to the outcome of the screening tests. We 
examined and coded the length and comprehensiveness 
of the neuropsychological test battery into 'short', 
'medium' and 'long and comprehensive' battery (See 
Table 2 for 'short', 'medium' and 'long' battery criteria). 
Nine studies (29%) used a 'short' [6,23-25,27,33,37, 
41,43], 11 (35%) used a 'medium' [7,22,28-30,32,35, 
38,40,42,44] and seven (23%) used a 'long' [17-21,26,36] 
battery. For the HDS and IHDS, there were sufficient 
data to evaluate the associations between sensitivity and 
specificity and battery size. There was a significant 
association for the HDS between sensitivity and size of 
battery (P<0.05), with sensitivity decreasing with 
increasing battery size. There was a trend for increasing 
specificity with increasing battery size, but this association 
was not significant. 

Type of reference test / criterion and sensitivity 
performance 

Twenty-six of the 39 tool evaluations (67%) used a detailed 
neuropsychological battery as the reference standard 
[18-22,26,28-30,32,35,36,38,40-44,64], Fifteen tool 
evaluations (38%) reported adequate sensitivities 
(>0.75). Adequate sensitivities (>0.75) were reported 
for the HDS [6,24,31,40,44], IHDS [7,41], CogState 
[20], Grooved Pegboard [6], Hopkins Verbal Learning 
Test-Revised/ Grooved Pegboard Test Non-Dominant 
Hand [19], Hopkins Verbal Learning Test/WAIS-III 
Digit Symbol [19], Mental Alternation Test [27], Motor 
Battery [36], the Screening Algorithm [21] and the 
Timed Gait Test [39]. 

Meta-analysis findings 

The HDS and IHDS have been sufficiently studied to 
perform a meta-analysis for each tool. For inclusion in the 
meta-analysis, the articles we reviewed were required to 
use a neuropsychological battery as the reference test; use 
the neuropsychological battery to classify the participants 
as 'impaired'; and report the incidence of impairment, 
sensitivity and specificity of the tools. Seven studies on 
the HDS [6,19,34,38,40,42,44] and three studies on the 
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IHDS met these criteria [7,28,41]. One study on the 
IHDS looked at this tool in two different participant pools 
[7], and as such, the results from each of the participant 
pools were included in the meta-analysis. 
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Meta-analysis and qualitative results with the HIV 
Dementia Scale 

The Forest Plot of the meta-analysis of the HDS (Fig. 2a) 
shows a large range in the sensitivities for the HDS across 
studies; four studies [19,34,38,42] reported poor sensi- 
tivity (i.e. <0.55) and three studies [6,40,44] reported 
good sensitivity (i.e. >0.80). Overall, the pooled 
sensitivity of these studies (i.e. 0.48) would suggest that 
the HDS (without any corrections for demographics) is 
not useful for detecting a range of HAND conditions. 
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The studies that report adequate sensitivities for the HDS 
fall into three main groups. The first group, the study by 
Power et al. [6] (sensitivity 0.81), utilized the HDS to 
detect frank dementia. The second group, studies by 
Simioni et al. [40] and Wojna et al. [44] (sensitivity 0.82 
and 0.87), used a high cut-off score (13 or 14). We meta- 
analysed these two studies together (Fig. 2b) and 
determined that the pooled sensitivity increased greatly 
when only evaluations with higher cut-offs were used 
(0.84). Finally, Morgan et al. [34] used demographic 
corrections for age and education ("P-scores) and increased 
the sensitivity from 0.17 with a cut-off of 10 or less to 
0.71. These results suggest that the utility of the HDS can 
be improved by increasing the cut-off score, or by using 
demographically adjusted T-scores, but only for the 
detection of dementia. 



The two studies with the lowest and comparable 
sensitivities [i.e. 0.09 and 0.17; Preliminary results from 
our CIHR Screening study (« = 20) also produced similar 
results with a sensitivity of 0.12 using a 'long and 
comprehensive' battery (personal communication, S.B.R., 
13 March 2013)] both used a 'long and comprehensive' 
reference battery [19,34], whereas those that used a 
'medium' reference battery [38,40,42,44] had not only 
higher sensitivities but also a larger range of sensitivities 
(0.39—0.87). These discrepancies in results using 'medium' 
versus 'long' reference batteries suggest that the rigour and 
comprehensiveness of the battery may be important 
determinants to consider when evaluting the overall test 
performance of screening tests for HAND. 
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Meta-analysis with the International HIV Dementia 
Scale 

The meta-analysis of the IHDS (Fig. 2c) demonstrated 
more consistency than the HDS, although these 
studies employed only 'small' and 'medium' length 
reference batteries. One [28] study reported a poor 
sensitivity for the IHDS (0.45) and the other three 
[7,41] studies reported good sensitivities (>0.80). 
Overall, the pooled sensitivity of these studies (0.62) 
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(a) 



Power 1995 (small) 
Wojna 2007 (medium) 
Smith 2003 (medium) 
Simioni 2010 (medium) 
Richardson 2005 (medium) 
Carey 2004 (large) 
Morgan 2008 (large) 
Rourke2013 (large) 



Sensitivity (95% CI) 

(0.65 - 0.92) 
(0.66 - 0.97) 
(0.24 - 0.55) 
(0.72 - 0.90) 
(0.32 - 0.77) 
(0.03 - 0.20) 
(0.09 - 0.29) 
(0.01 -0.36) 



0.81 
0.87 
0.39 
0.82 
0.55 
0.09 
0.17 
0.12 



0.2 



0.4 0.6 
Sensitivity 



0.8 



Pooled sensitivity = 0.48 (0.42 to 0.53) 
j Chi-square = 1 46.05; df = 7 (P = 0.0000) 
1 Inconsistency (l-square) = 95.2% 



(b) 



Simioni 2010 (medium) 
Wojna 2007 (medium) 



Sensitivity (95% CI) 

0.82 (0.72 - 0.90) 
0.87 (0.66 - 0.97) 



0.2 0.4 0.6 0.8 

Sensitivity 



Pooled sensitivity = 0.84 (0.75 to 0.90) 
Chi-square = 0.27; df = 1 (P = 0.601 8) 
1 Inconsistency (l-square) = 0.0% 



(c) 



14 



Sensitivity (95% CI) 



Sacktor 2005 USA (small) 
Sacktor 2005 Uganda (small 
Joska 201 1 (medium) 
Singh 2008 (medium) 



0.80 
0.80 
0.45 
0.88 



(0.59 - 0.93) 
(0.59-0.93) 
(0.34 - 0.57) 
(0.62 - 0.98) 



0.2 



0.4 0.6 

Sensitivity 



0.8 



Pooled sensitivity = 0.62 (0.53 to 0.70) 
Chi-square = 22.1 2; df = 3 (P = 0.0001 ) 
Inconsistency (l-square) = 86.4% 



Fig. 2. Forest plots, (a) Utility of the HIV Dementia Scale in detecting HIV-associated neurocognitive disorders, (b) Utility of the 
HIV Dementia Scale with cut-off of 13 or 14. (c) Utility of the International HIV Dementia Scale in detecting HIV-associated 
neurocognitive disorders. CI, confidence interval. 



demonstrates that the IHDS may be moderately useful 
in detecting HAND. 

Quality appraisal 

The quality assessment demonstrated that there are specific 
areas in which the studies tend to have methodological 
shortcomings (Fig. 3). Very few studies fulfilled criteria 
10 and 11: 10, 'Were the index test results interpreted 
without knowledge of the results of the reference 
standard?' (6/31, 19%) [19,22,30,31,41,44] and 11, 'Were 
the reference standard results interpreted without know- 
ledge of the results of the index test?' (7/31, 23%) 
[19,22,30,31,39,41,44]. This low level suggests that there 
may be some introduction of bias in the data interpretation, 
which may result in an overestimation of sensitivity. 

Only about half the studies fulfilled criterion 1 , 'Was the 
spectrum of patients representative of the patients 



who will receive the test in practice?' (17/31, 54%) 
[6,7,17,19,21,22,24,25,27,28,31,32,34,36,37,39,41]. 
This suggests that there are a good number of studies that 
may have limited applicability to real-life clinical settings. 

The quality assessment did highlight some successful 
methodologies that should be continued in further 
evaluations. First, the majority of studies administered 
the same reference test and screening test to all study 
participants [criterion 5, 'Did the whole sample or a 
random selection of the sample receive verification using 
a reference standard of diagnosis?' (30/31, 97%) [6,7, 
17—28,30—44,64] and criterion 6, 'Did patients receive 
the same reference standard regardless of the index test 
result?' (30/31, 97%) [6,7,17-33,35-44,64]]. This 
allowed for proper and comprehensive calculation of 
sensitivity and specificity. Second, the majority of studies 
described their procedures with sufficient detail for 
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Fig. 3. Number of papers fulfilling the QUADAS criteria (1-14). 



replication of criterion 8 - 'Was the executive of the 
index test described in sufficient detail to permit 
replication of the test?' (30/31, 97%) [6,7,17-22, 
24—44,64], enabling future comparison of newer 
screening tools to already evaluated tools. 



Discussion 



With the prevalence of milder forms of HAND 
increasing, and limited resources available for formal 
neuropsychological examinations, there is a critical need 
to be able to screen and identity people with HAND. Our 
current systematic review identified 51 peer-reviewed 
articles that focused on screening for neurocognitive 
impairments associated with HIV/AIDS. Of these, about 
two-thirds (31/51) used a criterion or reference to 
characterize the performance of 21 unique screening 
instruments. About half of the 31 studies examined the 
performance of the HDS or the IHDS as screening tests. 
Fifty-five percent of studies focused on detecting 
neurocognitive impairments, 35% on HAD and 15% 
in identifying MND or ANI. Only 16% of studies 
employed the latest 2007 HAND criteria [3]. 

The results of this systematic review suggest that the HDS 
and IHDS are not ideal tools for identifying a range of 
neurocognitive impairment. From the meta-analysis on 
the HDS and IHDS, we were able to determine that these 
tools have generally poor (0.48) and moderate (0.62) 
pooled sensitivities, respectively, in identifying a range of 
neurocognitive impairment. We also identified a potential 
relationship between the reference test and criterion with 
the range of sensitivities on the HDS and the IHDS, that is 
those studies using the most comprehensive neuropsy- 
chological test battery as the 'gold standard' criterion had 
the lowest (and similar) sensitivity values, whereas those 
that were considered 'short' or 'medium' in time (and 
comprehensiveness) resulted in much larger ranges of 
sensitivities. The association between lower sensitivity 



(ability to identify true positives) and large reference 
battery was significant for the HDS (P<0.05). These 
findings suggest that a longer, more comprehensive 
neuropsychological battery may be a more stringent and 
appropriate reference test. Future studies evaluating sensi- 
tivity may want to consider using a large neuropsycho- 
logical battery as a reference test to classify participants in a 
more discerning way. Studies using the HDS to identify a 
range of neurocognitive impairment reported higher 
sensitivities when higher cut-offs were used [40,44] or 
demographically adjusted T-scores were used [34]. 

The review identified 10 other screening tools with 
adequate sensitivities (>0.75). Of these, four tools or 
combinations of tests were used to detect HAND 
conditions and overall neurocognitive impairment (as 
opposed to impairment in specific domains) and they 
used a 'gold-standard' neuropsychological battery as the 
reference test or criterion. These four tools include the 
CogState [20], the Screening Algorithm [21], the paired 
Hopkins Verbal Learning Test and WAIS-III Digit 
Symbol combination and the paired Hopkins Verbal 
Learning Test and Grooved Pegboard Non-Dominant 
Hand combination [19]. It is important to note that the 
other studies defined neurocognitive impairment in 
different ways and evaluated these conditions using 
different criteria; thus, the comparison of efficacy across 
all studies needs to be carried out with caution given these 
limitations. In addition to these four tests, Becker et at 
[17] reported a slightly lower sensitivity (0.72) for the 
Computer Assessment of Mild Cognitive Impairment 
(CAMCI) against a comprehensive neuropsychological 
battery. These are the five tools identified by our review 
that show some promise to be used to identify HAND in a 
clinical setting, although they warrant further study and 
validation. 

In the 20 studies that met review criteria and are reported 
in Appendix B, http://links.lww.com/QAD/A370, one 
other potentially useful tool, The Montreal Cognitive 
Assessment (MoCA) [48], was highlighted. This tool is 



Copyright © Lippincott Williams & Wilkins. Unauthorized reproduction of this article is prohibited. 



Tools for HIV-associated neurocognitive disorder Zipursky et ai 



2399 



worth note because of its rising popularity in clinical 
settings, although the reliability and validity to detect 
HAND remains to be determined. 

The review also highlighted key methodological issues in 
this body of literature. First, in order to properly 
determine whether a screening tool can detect the 
target condition, it is imperative that the reference test 
properly classifies the condition [16]. Only 19 studies of 
the 31 (61%) [18-22,26,28-30,32,35,36,38,40-44,64] 
used the 'gold-standard' neuropsychological battery as the 
reference test. Furthermore, only seven (23%) studies 
[17—21,26,36] used more comprehensive reference tests 
(>90 min) and functional status was measured objectively 
in only nine studies [20-22,30,34,38,39,44,64] (29%). In 
addition, only five studies [21,28,30,40,64] used the most 
recent 2007 criteria to classify HAND [3] . The QUADAS 
assessment reported that only 17 of the 31 (54%) 
[6,7,17,21,22,24,25,27,28,31,32,34,36,37,39,41] were 
using representative samples, noting a severe limitation 
in the applicability of the studies. For cognitive screening 
tools to be properly evaluated, and for these evaluations 
to inform clinical practice, more stringent methodo- 
logical procedures are needed. 

The results of this systematic review provide insights into 
the performance of the current screening instruments 
available for HAND and identify key methodological 
issues that need to be addressed in future studies. In this 
current context of mild cognitive impairment associated 
with HIV, it is clear that improved screening tools could 
go a long way in improving the care, quality of life and 
study of treatment interventions for individuals living 
with HIV and AIDS. 
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